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Abstract —With the explosive growth of mobile data demand, 
there has been an increasing interest in deploying small cells 
of higher frequency bands underlying the conventional homo¬ 
geneous macrocell network, which is usually referred to as 
heterogeneous cellular networks, to significantly boost the overall 
network capacity. With vast amounts of spectrum available in 
the millimeter wave (mmWave) band, small cells at mmWave 
frequencies are able to provide multi-gigabit access data rates, 
while the wireless backhaul in the mmWave band is emerging 
as a cost-effective solution to provide high backhaul capacity 
to connect access points (APs) of the small cells. In order to 
operate the mobile network optimally, it is necessary to jointly 
design the radio access and backhaul networks. Meanwhile, 
direct transmissions between devices should also be considered 
to improve system performance and enhance the user experience. 
In this paper, we propose a joint transmission scheduling scheme 
for the radio access and backhaul of small cells in the mmWave 
band, termed D2DMAC, where a path selection criterion is 
designed to enable device-to-device transmissions for perfor¬ 
mance improvement. In D2DMAC, a concurrent transmission 
scheduling algorithm is proposed to fully exploit spatial reuse 
in mmWave networks. Through extensive simulations under 
various traffic patterns and user deployments, we demonstrate 
D2DMAC achieves near-optimal performance in some cases, and 
outperforms other protocols significantly in terms of delay and 
throughput. Furthermore, we also analyze the impact of path 
selection on the performance improvement of D2DMAC under 
different selected parameters. 

Index Terms —Heterogeneous cellular networks, small cells, 
MAC scheduling, millimeter wave communications, 60 GHz, 
device-to-device. 


I. Introduction 

Mobile traffic demand is increasing rapidly, and a 1000- 
fold demand increase by 2020 is predicted by some industry 
and academic experts ID, 0- In order to meet this explosive 
growth demand and enhance the mobile network capacity, 
there has been an increasing interest in deploying small cells 
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underlying the conventional homogeneous macrocell network 
0. This new network deployment is usually referred to as 
heterogeneous cellular networks (HCNs). However, reducing 
the radii of small cells in the carrier frequencies employed 
in today’s cellular systems to reap the spatial reuse benefits 
is fundamentally limited by interference constraints 0. By 
using higher frequency bands, such as the millimeter wave 
(mmWave) bands between 30 and 300 GHz, and bringing 
the network closer to users by a dense deployment of small 
cells 0,0, HCNs can significantly boost the overall network 
capacity due to less interference and higher data rates. With 
huge bandwidth available in the mmWave band, small cells at 
mmWave frequencies are able to provide multi-gigabit com¬ 
munication services, such as uncompressed and high-definition 
video transmission, high speed Internet access, and wireless 
gigabit Ethernet for laptops and desktops, and have attracted 
considerable interest from academia, industry, and standards 
bodies 0, Q. Rapid development in complementary metal- 
oxide-semiconductor radio frequency integrated circuits 0, 
0 , m accelerates the industrialization of mmWave com¬ 
munications. Meanwhile, standards, such as ECMA-387 CD 
, IEEE 802.15.3c CD, and IEEE 802.11ad HU- have been 
defined for indoor wireless personal area networks (WPAN) 
or wireless local area networks (WLAN). 

Unlike existing communication systems of macrocells using 
lower carrier frequencies (e.g., from 900 MHz to 5 GHz), 
small cells in the mmWave band suffer from high propagation 
loss. The free space propagation loss at 60 GHz band is 28 
decibels (dB) more than that at 2.4 GHz lfl4l . To combat severe 
channel attenuation, high gain directional antennas are utilized 
at both the transmitter and receiver to achieve directional 
transmission CD, ED, G3. With a small wavelength, it is 
feasible to produce low-cost and compact on-chip and in¬ 
package antennas in the mmWave band ED, ED. Under 
directional transmission, the third party nodes cannot perform 
carrier sense as in IEEE 802.11 to avoid contention with 
current transmission, which is referred to as the deafness 
problem. On the other hand, there is less interference among 
links, and concurrent transmissions (spatial reuse) can be ex¬ 
ploited to greatly improve network capacity. Due to directivity 
and high propagation loss, the interference among cells is 
minimal, and total capacity of small cells scales with the 
number of cells in a given region 0 . On the other hand, as 
the data transmission rates of radio access networks increase 
dramatically, providing backhaul to transport data to (from) 
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the gateway in the core network for small cells becomes 
a significant challenge problem 0- Although fiber based 
backhaul offers larger bandwidth to meet this requirement, it 
is costly, inflexible and time-consuming to connect the densely 
deployed small cells with high connectivity. In contrast, high 
speed wireless backhaul is more cost-effective, flexible, and 
easier to deploy 0, fl9l . |20l , lf2TI . Wireless backhaul in 
mm Wave bands, especially 60 GHz band, can be a promising 
backhaul solution for small cells |20) . The wide bandwidth 
enables several-Gbps data rate even with simple modulation 
schemes such as OOK, BPSK and FSK. 



Fig. 1. Dense deployment of small cells in the 60 GHz band underlying the 
macrocell network. 

In Fig. |T] we present a typical scenario for dense deployment 
of small cells in the 60 GHz band underlying the macrocell 
cellular network, which forms HCNs. In HCNs, the macrocell 
is coupled with the small cells in the mmWave band to some 
extent ll22ll . and the macrocell network is allowed to have 
certain visibility and radio resource management control over 
the small cells |23| . Users can communicate via the current 
macrocell as well as the small cells. In the small cells, mobile 
users are associated with access points (APs), and APs are 
connected through wireless backhaul links. Some APs are 
connected to the Internet via a direct and high speed wired 
connection, which are called gateways. In this targeted small 
cells system, both the radio access and backhaul links are in 
the 60 GHz band, which can provide high data rate for both 
radio access and backhaul, and also reduce implementation 
complexity and deployment cost. Considering the fundamental 
differences between mmWave communications and existing 
other communication systems using lower carrier frequencies, 
scheduling over both the radio access and backhaul networks 
is a difficult problem and should be carefully designed. There 
are two aspects of challenges. In the first aspect, it is necessary 
to consider both radio access and backhaul networks jointly 
l24l . On one hand, co-design of scheduling for transmission 
over radio access and backhaul networks can maximize the 
spatial reuse across radio access and backhaul networks. 
Consequently, network capacity and performance can be im¬ 
proved significantly. On the other hand, joint scheduling for 


transmission over radio access and backhaul networks can also 
manage the interference both within each cell and among cells 
efficiently. In the second aspect, under dense deployment, there 
is a high probability that two devices within both the same cell 
and different cells are located near to each other. In this case, 
user devices can transmit data to each other over direct links 
using the small cell resources, instead of through APs. Thus, 
most of context-aware applications that involve discovering 
and communicating with nearby devices, including the popular 
content downloading, can benefit from the device-to-device 
communications by reducing the communication cost since 
it enables physical-proximity communication, which saves 
power while improving the spectral efficiency ll25ll . Therefore, 
the potential of device-to-device communications to enhance 
the network performance should also be considered in the 
joint scheduling problem over both radio access and backhaul 
networks. 

Aiming to address the above challenges, in this paper, 
we propose a joint transmission scheduling scheme, termed 
D2DMAC, for small cells in the mmWave band. In D2DMAC, 
joint scheduling over radio access and backhaul networks 
is considered, and direct transmissions between devices are 
further exploited to improve network performance in terms of 
throughput and delay. If the direct link between the sender 
and receiver of one flow has high channel quality, the direct 
transmission will be adopted instead of transmission through 
the backhaul network. The contributions of this paper are 
three-fold, which are summarized as follows. 

« We formulate the joint scheduling problem over both 
the radio access and backhaul networks with the direct 
transmissions between devices considered into a mixed 
integer linear program (MILP), i.e., to minimize the 
number of time slots to accommodate the traffic demand 
of all flows. Concurrent transmissions, i.e., spatial reuse, 
are explicitly considered in this formulated problem. 

« We propose an efficient scheduling scheme termed 
D2DMAC, which consists of a path selection criterion 
and a transmission scheduling algorithm to solve the for¬ 
mulated problem. The priority of device-to-device trans¬ 
mission is characterized by the path selection parameter 
of the path selection criterion, while concurrent transmis¬ 
sions are fully exploited in the transmission scheduling 
algorithm to maximize the spatial reuse gain. 

• We analyze the concurrent transmission conditions in 
D2DMAC, and derive a sufficient condition for each link 
to support concurrent transmissions with other links, i.e. 
, the distances between the interference sources and the 
receiver of this link should be larger than or equal to its 
interference radius. We also analyze the interference radii 
under different modulation and coding schemes, different 
path loss exponents, different link lengths, and different 
transmission power. 

• Through extensive simulations under different traffic pat¬ 
terns and user deployments, we demonstrate our proposed 
D2DMAC achieves near-optimal performance in some 
cases in terms of delay and throughput by comparison 
with the optimal solution, and it outperforms the other 
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existing related protocols significantly. Furthermore, we 
analyze the impact of path selection parameter on perfor¬ 
mance improvement. 

The paper is organized in the following way: Section [TT] 
presents related work on scheduling in the mmWave band. 
Section [III] introduces the system model and illustrates our 
basic idea by an example. Section ITVl formulates the optimal 
joint scheduling problem of access and backhaul with device- 
to-device transmissions considered as an MILP. Section [V] 
presents our proposed scheme, D2DMAC, which includes the 
path selection criterion and transmission scheduling algorithm. 
Section [Vi] presents the conditions for concurrent transmis¬ 
sions and gives a performance analysis. Section IVTII shows the 
simulation results of D2DMAC under various traffic patterns 
and different path selection parameters, and the comparison 
with the optimal solution and other protocols. Finally, we 
conclude this paper in Section IVIIII 

II. Related Work 

Recently, some related work on scheduling for small cells in 
the mmWave band has been published. Since the standards of 
ECMA-387 03) and IEEE 802.15.3c Q2) adopt TDMA, some 
work is also based on TDMA (26), (27), (28), (29), (30), (ID . 
(32) . In two protocols based on IEEE 802.15.3c (28l . (29), 
multiple links are scheduled to communicate in the same slot 
if the multi-user interference (MUI) between them is below 
a specific threshold. Qiao et al. (30.) proposed a concurrent 
transmission scheduling algorithm to maximize the number 
of flows scheduled with the quality of service requirement of 
each flow satisfied. Cai et al. ED introduced the concept of 
exclusive region (ER) to support concurrent transmissions, and 
derived the conditions that concurrent transmissions always 
outperform TDMA. Qiao et al. (32) proposed a multi-hop 
concurrent transmission scheme (MHCT) to address the link 
outage problem (blockage) and combat large path loss to 
improve flow throughput. For bursty traffic, TDMA based 
protocols face the problem that the medium access time for a 
flow is often highly unpredictable, which will result in over¬ 
allocated medium access time for some users while under¬ 
allocated medium access time for others. Besides, TDMA 
based protocols may also have high control overhead for 
medium reservation. 

There is some other work based on a central coordinator to 
coordinate the transmissions in WPANs am am, m, Ei- 
Gong et al. f33) proposed a directional CSMA/CA protocol, 
which exploits virtual carrier sensing, and depends on the 
piconet coordinator (PNC) to distribute the network allocation 
vector (NAV) information. It mainly focuses on solving the 
deafness problem and does not exploit the spatial reuse fully. 
Recently, Son et al. [18) proposed a frame based directional 
MAC protocol (FDMAC). The high efficiency of FDMAC is 
achieved by amortizing the scheduling overhead over multiple 
concurrent transmissions in a row. The core of FDMAC is the 
Greedy Coloring algorithm, which fully exploits spatial reuse 
and greatly improves the network throughput compared with 
MRDMAC (17) and memory-guided directional MAC (MD- 
MAC) (34) . FDMAC also has a good fairness performance and 


low complexity. Singh et al. EO proposed a multihop relay 
directional MAC protocol (MRDMAC), which overcomes the 
deafness problem by PNC’s weighted round robin scheduling. 
In MRDMAC, if a wireless terminal (WT) is lost due to 
blockage, the access point (AP) will choose a live WT to 
act as a relay to the lost WT. Since most transmissions are 
through the PNC, MRDMAC does not consider the spatial 
reuse. Chen et al. (35) proposed a directional cooperative 
MAC protocol, D-CoopMAC, to coordinate the uplink channel 
access in an IEEE 802.llad WLAN. D-CoopMAC selects a 
relay for a direct link, and when the two-hop link outperforms 
the direct link, the latter will be replaced by the former. 
Spatial reuse is also not considered in D-CoopMAC since most 
transmissions go through the AP. All the work above considers 
scheduling in the access network, and does not consider the 
backhaul network. For the small cell networks in the mmWave 
band, since the backhaul network becomes the bottleneck, the 
backhaul and access networks should be considered jointly to 
promote the spatial reuse and perform efficient interference 
management both within and among cells. 

On the other hand, there is also some related work on 
wireless backhaul networks. Lebedev et al. [20j identified ad¬ 
vantages of the 60 GHz band for short-range mobile backhaul 
through feasibility analysis and comparison with the E-band 
technology. Islam et al. (3) performed the joint cost optimal 
aggregator node placement, power allocation, channel schedul¬ 
ing and routing to optimize the wireless backhaul network in 
mmWave bands. Bojic et al. ED discussed advanced wireless 
and optical technologies for small-cell mobile backhaul with 
dynamic software-defined management in 60 GHz band and 
E-band. Bernardos et al. (24) discussed the main challenges 
to design the backhaul and radio access network jointly in 
a cloud-based mobile network. Design ideas involving the 
physical layer, the MAC layer, and the network layer are 
proposed. Most of the work on the backhaul above does not 
consider the access network, and fails to give a joint design of 
scheduling over the access and backhaul networks. Besides, 
direct transmissions between devices are also not exploited 
to improve network performance in the work above. To the 
best of our knowledge, we are the first to consider the trans¬ 
mission scheduling over both access and backhaul networks 
jointly, and also exploit the device-to-device transmission for 
performance improvement. 

As regards small cells in the mmWave band, most work 
focused on using bands such as 28GHz, 38GHz and 73GHz to 
attain ranges of the order of 200m or even more (36) . limZhu 
et al. SI proposed a 60GHz picocell architecture to augment 
existing LTE networks for significant increase in capacity. The 
feasibility of this architecture is investigated by characterizing 
range, attenuation due to reflections, sensitivity to movement 
and blockage, and interference in typical urban environments. 
As in Ref. ffl, the oxygen absorption that peaks at 60GHz is 
not a barrier to high-capacity 60GHz outdoor picocells, and 
can even promote spatial reuse due to less interference from 
faraway base stations. 




4 


III. System Overview 

A. System Model 

We consider the scenario that multiple small cells are 
deployed. In each small cell, there are several wireless nodes 
(WNs) and an access point (AP), which synchronizes the 
clocks of WNs and provides access services within the cell. 
The APs form a wireless backhaul network in the mmWave 
band, which should be stationary. Also, the backhaul links 
between APs are able to be optimized in order to achieve 
high channel quality and reduce interference. There are some 
APs connected to the Internet via a direct and high speed 
wired connection, which are called gateways. The remaining 
APs should communicate with the gateways to send (receive) 
data to (from) Internet. The backhaul network can form 
arbitrary topology like ring topology, star topology, or tree 
topology (38) , where the backhaul paths between APs should 
be optimized to maximize backhaul efficiency. To overcome 
huge attenuation in the mmWave band, both the WNs and APs 
achieve directional transmissions with electronically steerable 
directional antennas by the beamforming techniques EUM). 
In such a system of small cells networks, we schedule the 
transmission of the access and backhaul networks jointly, 
which apparently differs from the scheduling problem of the 
ad-hoc network. 

In this paper, we adopt centralized control. On one hand, 
distributed control does not scale well 02, and the latency 
will increase significantly with the number of APs, which is 
unsuitable for mmWave communication systems where a time 
slot only lasts for a few microseconds. Moreover, distributed 
control is difficult to achieve intelligent control mechanisms 
required for the complex operational environments that involve 
dynamic behaviors of accessing users and temporal variations 
of the communication links. On the other hand, centralized 
control is able to achieve flexible control in an optimized 
manner ED- There is a centralized controller in the network, 
which usually resides on a gateway and is similar to the 
central coordinator in IEEE 802.1 lad fl3l . The system is 
partitioned into non-overlapping time slots of equal length, 
and the controller synchronizes the clocks of APs. Then the 
clocks of WNs are synchronized by their corresponding APs. 

There is a bootstrapping program in the system, by which 
the central controller knows the up-to-date network topology 
and the location information of APs and WNs. The network 
topology can be obtained by the neighbor discovery schemes 
in E2, (43l . 04) . 03) , ESI . Location information of nodes 
can be obtained based on wireless channel signatures, such as 
angle of arrival (AoA), time difference of arrival (TDoA), or 
the received signal strength (RSS) 07), 08], 09), (50), l5l) . 
In our system, the bootstrapping program adopts the direct 
discovery scheme to discover the network topology 02]|. In the 
direct discovery scheme, a node is in the transmitting or receiv¬ 
ing state at the beginning of each time slot. In the transmitting 
state, a node transmits a broadcast packet with its identity in 
a randomly chosen direction. In the receiving state, a node 
listens for broadcast packets from a randomly chosen direction. 
If a collision happens, the node fails to discover any neighbor; 
otherwise, if the transmitter is unknown, the receiver discovers 


a new neighbor by recording the angle of arrival (AOA) and 
the transmitter’s identity. After the direct discovery, the nodes 
report discovered neighbors to the APs, and then APs report 
obtained topology information to the central controller. At the 
same time, the APs obtain the location information of WNs by 
a maximum-likelihood (ML) classifier based on changes in the 
second-order statistics and sparsity patterns of the beamspace 
multiple input multiple output (MIMO) channel matrix (47). 
Besides, in the heterogeneous cellular networks scenario, the 
location information can also be obtained by the cellular 
geolocation from cellular base stations in 08) if the macrocell 
is coupled with the small cells to some extent. Then the central 
controller obtains the location information of nodes from the 
APs. With relatively low mobility, the network topology and 
location information will be updated periodically. 

There are two kinds of flows transmitted in the network, 
flows between WNs and flows from or to the Internet (gate¬ 
way). Specifically, we assume there are N flows with traffic 
transmission demand in the network. For flow i, its traffic 
demand is denoted as di. The traffic demand vector for all 
flows is denoted as d, a 1 x N matrix whose ith element 
is di. For each flow, there are two possible transmission 
paths in our system, the ordinary path and direct path. The 
ordinary path is the transmission path through APs, which 
may includes the access link from the source to its associated 
AP, the backhaul path from the source’s associated AP to the 
destination’s associated AP or the gateway, and the access link 
from the destination’s associated AP to the destination. We 
also assume the backhaul path from the source’s associated 
AP to the destination’s associated AP across the backhaul 
network is pre-determined by some criterion, such as the 
criterion of minimum number of hops, or maximum minimum 
transmission rate on the backhaul path. For the j th hop link of 
the ordinary path for flow i, we denote its sender as s t j and 
receiver as r,; , and denote this link as r,; 7 ). The direct 
path is the direct transmission path from the source to the 
destination, which does not pass through the backhaul network 
and only has one hop. We denote the direct link of flow i as 
(sf,rf), where sf is the source, and rf is the destination. 

For each flow i, its transmission rate vector on the ordinary 
path is denoted as c\, where each element cfj represents the 
transmission rate of the yth hop on the ordinary path. The 
maximum number of hops of the ordinary paths is denoted 
by H max . We also denote the transmission rate matrix for the 
ordinary paths of flows as C b , where the /th row of C fa is cf 
and C b is an N x matrix. The transmission rate of the 

direct path for flow i is denoted as cf, and the transmission 
rate vector for the direct paths of flows is denoted as c d , a 
1 x N matrix whose ith element is cf. 

In the LOS case, the achievable maximum transmission 
rates of links can be estimated according to Shannon’s chan¬ 
nel capacity 11301 . In our system, the transmission rates on 
the ordinary paths and direct paths can be obtained via a 
channel transmission rate measurement procedure (55). In this 
procedure, the transmitter of each link transmits measurement 
packets to the receiver first. Then with the measured signal to 
noise ratio (SNR) of received packets, the receiver estimates 
the achievable transmission rate, and appropriate modulation 
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and coding scheme by the correspondence table about SNR 
and modulation and coding scheme. Under low user mobility, 
the procedure is usually executed periodically. Besides, since 
the APs are not mobile and positioned high above to avoid 
blockage a, the transmission rates of backhaul links can be 
assumed fixed for a relatively long period of time. 

With directional transmissions, there is less interference 
between links. Under low multi-user interference (MUI) lf30l . 
concurrent transmissions can be supported. In the system, all 
nodes are assumed to be half-duplex, and two adjacent links 
cannot be scheduled concurrently since each node has at most 
one connection with one neighbor d). For two nonadjacent 
links, we adopt the interference model in f30l and (56). For 
link ( Si,Ti ) and link ( Sj,rj ), the received power from s, to 
77 can be calculated as 

Pr j ,s i =fs i ,r j k 0 P t l Si r-\ ( 1 ) 


where P t is the transmission power that is fixed, ko = 
10 PL ( d o)/ 10 is the constant scaling factor corresponding to 
the reference path loss PL(do) with do equal to 1 m, l Sir . is 
the distance between node s,; and node 77 , and 7 is the path 
loss exponent ED- fsi,rj indicates whether s* and 77 direct 
their beams towards each other. If it is, f Si , rj = 1; otherwise, 
fsi,rj = 0. In other words, f Si , rj = 0 if and only if any 
transmitter is outside the beamwidth of the other receiver or 
does not direct its beam to the other receiver if it is within the 
beamwidth of the other receiver (57). Thus, the SINR at 77 , 
denoted by SINR Sjrj , can be calculated as 


SINR Sjr . 


k 0 Ptl Sjr3 ~ 7 

WNo+p^fsi^koPtls^- 1 ’ 


( 2 ) 


where p is the MUI factor related to the cross correlation 
of signals from different links, W is the bandwidth, and 
Nq is the onesided power spectra density of white Gaussian 
noise (30) . For link (*,, 77 ), the minimum SINR to support its 
transmission rate c SiyTi is denoted as MS(c Siri ). Therefore, 
concurrent transmissions can be supported if the SINR of each 
link ( Si,ri ) is larger than or equal to MS{c Siri ). 


B. D2DMAC Operation Overview 

The operation of D2DMAC is illustrated in Fig. [3] 
D2DMAC is a frame based MAC protocol as FDMAC |fT 8 ) . 
Each frame consists of a scheduling phase and a transmission 
phase, and the scheduling overhead in the scheduling phase 
can be amortized over multiple concurrent transmissions in 
the transmission phase as in m. In the scheduling phase, 
after the WNs of each cell steer their antennas towards their 
associated AP, each AP polls its associated WNs successively 
for their traffic demand. Then each AP reports the traffic 
demand of its associated WNs to the central controller on the 
gateway through the backhaul network. We denote the time 
for the central controller to collect the traffic demand as t po ii- 
Based on the transmission rates of links, the central controller 
computes a schedule to accommodate the traffic demand of 
all flows, which takes time t sc h■ Then the central controller 
pushes the schedule to the APs through the backhaul network. 


Afterwards, each AP pushes the schedule to its associated 
WNs. We denote the time for the central controller to push the 
schedule to the APs and WNs as t pus h . In the transmission 
phase, WNs and APs communicate with each other following 
the schedule until the traffic demand of all flows is accommo¬ 
dated. The transmission phase consists of multiple stages, and 
in each stage, multiple links are activated simultaneously for 
concurrent transmissions. In the schedule computation, first 
the transmission path should be selected optimally between 
the direct path and ordinary path for each flow. Then, the 
schedule should accommodate the traffic demand of flows with 
a minimum number of time slots, which indicates concurrent 
transmissions (spatial reuse) should be fully exploited. 



-► Backahul -► Access -► Device to Device 


Fig. 2. An example of D2DMAC with three cells. 


C. Problem Overview 

To maximize the transmission efficiency in the above intro¬ 
duced operation procedure, the traffic demand of flows should 
be accommodated with a minimum number of time slots in 
the transmission phase. To achieve this, first we should unleash 
the potential of spatial reuse to enable as many link to transmit 
concurrently as possible. Meanwhile, for flows with a direct 
link of high quality, we prefer the direct path to the ordinary 
path. In this case, we should enable the direct transmission 
from the source to the destination to improve flow and network 
throughput. 

Now, we present an example to illustrate the basic idea of 
D 2D MAC. In Fig. 0 there are three small cells. In cell 1, 
node C and D are associated with API; in cell 2, node A 
is associated with AP2; in cell 3, node B is associated with 
AP3. There are four flows in the network, A —> B, B —>• C, 
gateway —» B, and D —r gateway. The traffic demands of A 
A B, B -> C, gateway —>• B, and D —> gateway are 5, 6 , 
7, and 8 , respectively, and thus d = [5 6 7 8 ]. Numerically, 
they are equal to the number of packets to be transmitted, and 
the packet length is fixed. After the channel transmission rate 
measurement procedure, the obtained transmission rate matrix 
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Fig. 3. Time-line illustration of D2DMAC operation. 


for the ordinary paths of flows is 

/ 2 3 

, 2 4 

C fc = 

4 2 

\0 0 

which indicates the transmission rates of link A —> AP2, 
AP2 —> AP3, and AP3 —>- B are 2, 3, and 2 respectively. 
Numerically, the transmission rates are equal to the number 
of packets these links can transmit in one time slot. Since 
the flow from D to the gateway (API) does not need to go 
through the backhaul network, the fourth row of C b is set to 
0. The transmission rate vector for the direct paths of flows is 
c“ = [1 2 3 3], and it indicates the direct link of A —» B can 
transmit one packet in one time slot; the direct link API —>• B 
can transmit three packets in one time slot. For each flow, first 
we should select optimal transmission path between its direct 
path and ordinary path. Then the optimal schedule should 
accommodate the traffic demand of flows with a minimum 
number of time slots. Thus concurrent transmission should 
be fully exploited in the schedule. Due to the half-duplex 
assumption, adjacent links cannot be scheduled concurrently. 
Due to the inherent order of transmissions, preceding hops on 
each path should be scheduled ahead of hops behind. Besides, 
the SINR of each link scheduled for concurrent transmissions 
in a stage should be able to support its transmission rate. 

If we select flow A ^ B to be transmitted through its 
ordinary path, and three other flows to be transmitted through 
their direct paths, we can obtain a schedule to accommodate 
the traffic demand of four flows. The schedule consists of three 
stages, and is already illustrated in Fig. [3] In Fig. [2] we plot 
the links that are activated in the schedule of Fig. 2, and their 
transmission rates are labeled above these links. In the first 
stage, access links API —> B and A —>• AP2 transmit for three 
time slots; in the second stage, backhaul link AP2 —>• AP3 
transmits for two time slots, direct link B —>• C for three time 
slots, and access link D —>• API for three time slots; in the 
third stage, access link AP3 —>• B transmits for three time 
slots. In each stage, the SINR of each link is able to support 
its transmission rate. This schedule needs nine time slots to 
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2 

0 
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accommodate the traffic demand of four flows. Conversely, 
if the traffic of flow A —>• B is transmitted through the direct 
path, this flow needs five time slots to be accommodated. Since 
direct link B —>• C, access link API — y B, and direct link A 
—> B have common node B, i.e., are adjacent, they cannot 
transmit concurrently in one stage. In this case, it needs at least 
eleven time slots to accommodate the traffic demand of these 
three flows. Therefore, the selection of transmission paths for 
flows has a significant impact on the efficiency of scheduling. 
Besides, concurrent transmission scheduling also needs to be 
optimized to improve transmission efficiency. 

IV. Problem Formulation And Analysis 
A. Problem Formulation 

We denote a schedule as S, and assume it has K stages. 
In each stage of a schedule, multiple links are scheduled to 
transmit concurrently. For each flow i, we define a binary 
variable aq to indicate whether the direct link of flow i is 
scheduled to transmit in the fcth stage. If it is, aq = 1; 
otherwise, a k = 0. We also define a binary variable b k j to 
indicate whether the jth hop of the ordinary path of flow i is 
scheduled to transmit in the fcth stage. In the problem, we only 
consider flows that have at least one unblocked path, the direct 
path or the ordinary path. The number of time slots of the fcth 
stage is denoted as S k . For each flow i, we define the number 
of hops of its ordinary path as its hop number Hi, if flow i 
does not have the ordinary path as flow D —> gateway in Fig. 
|2], Hi will be set to 1. For any two links (s*, r**) and ( Sj,rj ), 
we define a binary variable I(si, ri, Sj,rj) to indicate whether 
these two links are adjacent. If they are, I(si,Ti,Sj,rj) = 1; 
otherwise, I(si,ri,Sj,rj) = 0. In a schedule, if a link is 
scheduled in one stage, it will transmit as many packets as 
possible until its traffic demand is cleared. Then, the link will 
not be active in the remaining slots of this stage. 

Given the traffic demand of flows, to maximize transmission 
efficiency, we should accommodate the traffic demand with 
a minimum number of time slots COD. The total number of 

K 

time slots of a schedule is ^ S k . Now, we analyze the system 

fc= l 

constraints for a schedule. First, we define a binary variable 
hij to indicate whether flow i has traffic to transmit and j 
exceeds its hop number Hi, which can be expressed as 


— 


1 , if di > 0 & j < Hi, y . . 
0 , otherwise. 


( 4 ) 


In a schedule, the traffic demand of each flow i should be 
accommodated no matter which path its traffic is transmitted 
through, which can be expressed as 


K 

Yj,S k b k ij c%+h ij 5 k a k 4) 

fc =i 

To avoid frequent beamforming or steering, each link can be 
activated at most once in a schedule 111 81 . Besides, we assume 
the traffic of each flow can only be transmitted through its 
direct path or its ordinary path. If its traffic is scheduled on 


> di, if di > 0 & j < Hi, y . . 
= 0, otherwise. 
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the ordinary path, each hop of its ordinary path should be 
activated once in one stage. If its traffic is scheduled on the 
direct path, its direct link should be activated once in one stage. 
Thus, the formulated constraint can be expressed as follows. 


K 

J2(bij+h^) 

k =1 

Adjacent links cannot be scheduled concurrently in the 
same stage due to the half-duplex assumption, which can be 
formulated as follows. 


= 1, if di > 0 & j < Hi, y . . 
= 0, otherwise. ’ ^ 


bjj T b uv f 1 . if I^Sij ,Tij, S uv , T uv ) — 1 , V i, j , U, V , k , 

(7) 


a u + a v < 1, if I(s u ,r u ,s v ,r v ) = 1; Vu,u,fc; 


( 8 ) 


a k + b k v <l, if I{Si,rf,s uv ,r uv ) = 1. V i,u,v,k. 

(9) 

Due to the inherent order of transmissions on the ordinary 
path of flow i, links on the same path cannot be scheduled 
concurrently in the same stage, which is formulated as 

Hi 

£>&■< 1- Vt,fc. (10) 

7=1 


Besides, the jth hop link of the ordinary path of flow i should 
be scheduled ahead of the (j + l)th hop link, which can be 
formulated as 


E b ij > E b i U+1) , if Hi > 1; 

k V i, j = 1 ~ (Hi — 1), K* = 1 


K. 


( 11 ) 


This constraint is a group of constraints since K* varies from 
1 to K. 

To enable concurrent transmissions, the SINR of each link 
in the same stage should be able to support its transmission 
rate, which is formulated for links both on the direct path and 
ordinary path as follows. 


WN 0 +pJ2 J2 /s u „,ry 7 +pX) 

> MS(c b i:j ) x ii|; V i,j, k; 


f.d 


,a^k 0 Ptl s d r .. 7 

j p s p’ r ij 


( 12 ) 


fco Ptl a d t 

WN 0 +pJ2I2 f s r db* v k 0 Ptl a r d~' , +pY.f s d r daqk 0 P t l sdrd -'i 

u v u ’ i v ’ i q <2 ’ i <1 ’ i 

> MS(Ci) x a k . V i,k. 

(13) 

Therefore, the problem of optimal scheduling (PI) is for¬ 
mulated as follows. 


b k - € 
y 


{0,1}, if di > 0 & j < Hi 
{0}, otherwise; 


f {0,1}, if di > 0, 
1 { {0}, otherwise; 

Constraints (l4ljl3]>. 


V i,k 


(15) 

(16) 


In the above formulated problem, constraint (IT5l) indicates 
b k j is a binary variable and is set to 0 if flow i does not have 
traffic demand, or j exceeds Hi. Constraint (IT6l) indicates «■' 
is a binary variable and is set to 0 if flow i does not have 
traffic demand. 


B. Problem Reformulation 

In Problem PI, we can observe that constraints ©, £□>, 
and (fHl i have second-order terms and thus are nonlinear 
constraints. Thus, problem PI is a mixed integer nonlinear 
program (MINLP), which is generally NP-hard. For these 
second-order terms, we adopt a relaxation technique, the 
Reformulation-Linearization Technique (RLT) (58l . l59l to 
linearize them. RLT can produce tight linear programming 
relaxations for an underlying nonlinear and non-convex poly¬ 
nomial programing problem. RLT applies a variable substi¬ 
tution for each nonlinear term in the problem to linearize 
the objective function and the constraints. Nonlinear implied 
constraints for each substitution variable are also generated as 
the products of bounding terms of the decision variables, up 
to a suitable order. 

For the second order terms, 8 k b k ^ and S k a k , in constraint 
we define two substitution variables, u k j = 5 k b k } and 
v k = S k a k as in fT8l . We also define 

<5max = . max { [di/41 , \di/cf\ \c b tj fO,cf ^ 0} (17) 

V 1,3=1 ~Hi J J 

as the maximum possible number of time slots of a stage. Then 
we know 0 < 5 k < <5 m ax- Since 0 < b k j < 1, we can obtain 
the RLT bound-factor product constraints for as follows 

m. 


’ 4 > 

dmaxdy - U ij > 

8 k -u%> 0, 

. d max - 5 k - 8 max b k j + 4 > 0; 


for all i,j,k. 


(18) 


For v k , we can obtain its RLT bound-factor product con¬ 
straints in a similar way. The RLT procedures for the second- 
order terms in constraints (fl2l > and (IT3l > are similar and thus 
omitted. After substituting these substitution variables into 
their responding constraints, we can reformulate problem PI 
into a mixed integer linear program (MILP) as follows. 


K 

min VE fc , 

k=l 

S. t. 


(14) 


K 

min S k , 

k=l 


s. t. 


(19) 






K 


k=l 



> di , if di > 0 k, j < Hi, 
= 0, otherwise; 


V i,j 

( 20 ) 


Constraint ( ITSl ) and generated RLT bound-factor product 
constraints for v!?. 

Constraints (fl2l) and ( ITTt i after the RLT procedure and 


generated RLT bound-factor product constraints. 
Constraints (Q}. (IbllTO) and (fl5l - H6] l. 

We consider the example in Fig. [2] The traffic demand 
vector d, the transmission rate matrix for the ordinary paths 
as C b , and the transmission rate vector for the direct paths c d 
are the same as those in Section IIII-CI We also assume for 
any two nonadjacent links (sj,rj) and ( sj,Tj ), f Si , r , is equal 
to 0. Then we solve problem ( ITbl i using an open-source MILP 
solver, YALMIP Ii60l . The path selection scheme is that flow 
A —>• B goes through its ordinary path, and the other three 
flows go through their direct paths. The optimal schedule has 
three stages, and is already illustrated in Fig. [3] 

The formulated MILP is NP-hard. The number of con¬ 
straints is 0((FH max ) 2 K), where H max is the maximum 
number of hops of ordinary paths of flows. The number of 
decision variables is 0((FH max ) 2 K). Using the branch-and- 
bound algorithm will take significantly long computation time 
ED, and it is unacceptable for practical mm Wave cells where 
the duration of one time slot is only a few microseconds. 
Therefore, we need heuristic algorithms with low computa¬ 
tional complexity to obtain near-optimal solutions in practice. 


V. The D2DMAC Scheme 

To solve the formulated MILP, we should consider two 
steps. At the first step, we should select an appropriate 
transmission path, the direct path or ordinary path, for each 
flow. Specially, direct transmissions should be enabled if there 
are direct paths of high channel quality. At the second step, 
we should exploit concurrent transmissions fully to improve 
transmission efficiency as much as possible. Following the 
above ideas, in this section, we propose a path selection crite¬ 
rion to decide transmission paths for flows, and a transmission 
scheduling algorithm to compute schedules to accommodate 
the traffic demand of flows with as few time slots as possible. 


A. The Path Selection Criterion 

Intuitively, for each flow, if its direct path has high channel 
quality, we should enable its direct transmission other than 
transmissions through its ordinary path. For a transmission 
path p of h hops, we define its transmission capability as 


A(p ) 


1 



( 21 ) 


where Cj is the transmission rate of the jth hop on p. 
Therefore, for each flow, we should choose the path with 
higher transmission capability between its direct path and 
ordinary path. The path selection criterion can be expressed 
as 


{ if j^>P, choose pf-, y ._ (22) 

[ otherwise, choose p\. 

For flow i, pf denotes its direct path, and p\ denotes its 
ordinary path. /3 is the path selection parameter, which is larger 
than or equal to 1. The smaller /?, the higher the priority of 
direct transmissions between devices. 

For the example in section HIl-CI we apply the path selection 
criterion with /3 equal to 2, and the result is that flow A 
—> B will be transmitted through its ordinary path, and the 
other three flows will be transmitted through their direct paths, 
which is the same as the optimal solution in Section HV-BI 

B. The Transmission Scheduling Algorithm 

After selecting transmission paths for flows, we propose a 
heuristic transmission scheduling algorithm to accommodate 
the traffic demand of flows with as few time slots as possible 
by exploiting spatial reuse fully. Since adjacent links cannot 
be scheduled concurrently in the same stage, links scheduled 
in the same stage should be a matching, and the maximum 
number of links in the same stage can be inferred to be 
|_n/2j |fl8l l. On the other hand, due to the inherent order 
of transmissions on each path, preceding hops should be 
scheduled ahead of subsequent hops since nodes behind on 
the path are able to relay the packets after receiving the 
packets from preceding nodes. Borrowing the design ideas of 
greedy coloring (GC) algorithm in FDMAC fT8l . we perform 
edge coloring on the first unscheduled hops of paths for all 
flows. After each coloring, we remove scheduled hops, and the 
set of first unscheduled hops is updated since the hops after 
these scheduled hops become the first unscheduled hops now. 
Our algorithm schedules the first unscheduled hops of flows 
iteratively into each stage in non-increasing order of weight 
with the conditions for concurrent transmissions satisfied. To 
maximize spatial reuse, our algorithm terminates scheduling 
in each stage when no possible link can be scheduled into this 
stage any more, i.e., when all possible links are examined, or 
the number of links in this stage reaches [rz/2J. Our algorithm 
carries out this process iteratively until all hops of flows are 
properly scheduled. 

For each flow i, we denote its selected transmission path by 
the path selection criterion by pi. We denote the number of 
nodes including APs and WNs by n, and the set of selected 
paths for all flows by P, which includes the transmission path 
Pi of each flow i. The set of hops in P is denoted by H. 
For each hop of each flow’s transmission path, we define the 
number of time slots to accommodate its traffic as the weight 
of this hop. For the jth hop of flow i, its weight is denoted 
by Wij, and we denote this link by (.Sij , ) with transmission 

rate c^ . The sequence number of the first unscheduled hop on 
path pi is denoted by F u (pi). In the fth stage, the set of paths 
that are not visited yet is denoted by P*. We denote the set of 
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directional links of the tth stage by iP, and the set of vertices 
of the links in fP by V 1 . 

The pseudo-code of our transmission scheduling algorithm 
is presented in Algorithm!]] In the initialization, we obtain the 
set of transmission paths of flows selected by the path selection 
criterion, P. From P, we also obtain the set of hops, H, and 
their weights. Since we should start scheduling the first hop 
of each path, F u (pi) for each path pi is set to 1 . As indicated 
in line 1, our algorithm iteratively schedules each hop of H 
into each stage until all hops in H are scheduled successfully. 
In the beginning of each stage, all paths in P are unvisited, as 
in line 4. In each stage, our algorithm iteratively visits each 
possible hop until all possible hops are visited, or the number 
of links in the stage reaches [tz/ 2J, as indicated by lines 5- 
24. In line 6 , our algorithm obtains the set of first unscheduled 
hops of unvisited paths, denoted by if*, and obtains the hop 
in fT* with the largest weight in line 7, (siF u ( Pi ), FiF u { Pi ))- 
Then our algorithm checks whether this hop is adjacent to the 
links already in this stage, as indicated by line 8. If it is not, 
this hop will be selected as the candidate link, and be added 
to this stage in lines 9-10. Then the condition for concurrent 
transmissions of each link in this stage is checked, as indicated 
by lines 11-16. If any of links in this stage cannot support its 
transmission rate, the candidate link will be removed from this 
stage, as indicated by lines 13-14 and 20-21. If the concurrent 
transmission conditions of links in this stage are satisfied, the 
newly added link will be removed from H, as indicated by 
line 17. The number of time slots of this stage is updated to 
accommodate the traffic of this newly added link, as indicated 
by line 18. The sequence number of first unscheduled hop of 
the path, where the newly added link is, will increase by 1 , 
as indicated by line 19. The path where this candidate link is 
will be removed from the set of unvisited paths whether this 
link is added to this stage or not, as indicated by line 23. The 
scheduling results for each stage are outputted in line 25. 

We apply this scheduling algorithm to the example in 
Section IIV-B1 and obtain the schedule as follows: in the first 
stage, the first hop A —> AP2 of flow A —> B, link B —>- 
C, and link D —> API transmit for three time slots; in the 
second stage, the second hop AP2 —>- AP3 of flow A —> B 
transmits for two time slots, and link API —>- B for three 
time slots; in the third stage, the third hop AP3 —> B of 
flow A —> B transmits for three time slots. The total number 
of time slots of this schedule is the same as that obtained 
by using optimization softwares. However, our algorithm has 
the computational complexity of 0(n 4 ), which is a pseudo¬ 
polynomial time solution and can be implemented in practice. 

Remark 1: In D2DMAC, only a small amount of control 
information should be transmitted from the central controller 
(APs and WNs) to the APs and WNs (the central controller). In 
the scheduling phase, a small amount of control data regarding 
traffic demand of WNs and APs should be collected by APs 
and transmitted to the central controller through the backhaul 
network. As in ED, with the Gbps transmission rate, the traffic 
demand of WNs can be collected by APs in a few time slots. 
Similarly, with the optimized backhaul network, the central 
controller is able to collect the traffic demand information 
from APs in a few time slots. After schedule computation, the 


Algorithm 1 The Transmission Scheduling Algorithm 

Initialization: 

Input: the set of selected paths of all flows, P; 

Obtain the set of hops in P, denoted by H\ 

Obtain the weight of each hop () £ H, Wij\ 

Set F u (pi) = 1 for each p, : £ P; t= 0; 

Iteration: 

l: while (\H\ > 0) do 
2 : t=t+l; 

3: Set V* = 0, iP = 0, and (5* = 0; 

4: Set P* with P* = P; 

5 : while (|P‘| > 0 and |iP| < |n/2j) do 

6 : Obtain the set of first unscheduled hops of P*, / /*; 

7: Obtain the hop (siF n ( Pi ), ^iF u ( Pi )) £ H f u with the 

largest weight; 

8: if ( s iFu ( Pi ) £ V* and r iFu ( Pi ) i V*) then 

9: ip = jp u {(SiF u ( Pi ), r iF u ( P i))}', 

10 : V* = V* U {SiF u ( Pi ), OF„(pi)} ; 

ll: for each link , r , :j ) in fP do 

12 : Calculate the SINR of link (Sy. ry), 

SINR Sijrij 

13: if ( SINR Sijrij < MS {a,)) then 

14: Go to line 20 

15: end if 

16: end for 

H: H = H {(SiF u ( P i) i G-Fufpi))}’ 

18: <5* = max{P, WiF u ( Pi )}\ 

19: F u (pi) = F u (pi) + 1; Go to line 22 

20 ; FI iP { i^iF u ( P i) 1 

-l- V V {SiF u (p^) ; TiF u ( P i )}, 

22 : end if 

23: P t u =P t u -p i \ 

24: end while 

25: Output fP and P; 

26: end while 


central controller should push the schedule to APs and WNs. 
In the same manner, the central controller is able to push a 
small amount of control data regarding the schedule to APs 
and WNs in a few time slots. The number of time slots in the 
traffic polling or schedule pushing depends on the number of 
APs and WNs and the transmission rates of links between the 
central controller and APs and links between APs and WNs. 

Remark 2: Retransmission is an important mechanism to 
improve the robustness of transmission. Retransmissions can 
be enabled in D2DMAC. In the traffic polling part in the 
scheduling phase, the receivers (APs or WNs) are able to 
collect the failed transmissions in the last frame, and if one 
failed packet needs to be retransmitted and its retransmission 
times do not exceed the maximum limit, the receiver will 
report the failed packet to the central controller. Then the 
central controller will schedule the retransmission of the failed 
packet in the current frame. 

VI. Performance Analysis 

The conditions for concurrent transmissions determine the 
links scheduled concurrently in the same stage in D2DMAC. 
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Thus, they have an important impact on the scheduling ef¬ 
ficiency of D2DMAC, and we should study them further 
for a better understanding of D2DMAC. In this section, we 
give a theoretical performance analysis by presenting the 
conditions for the concurrent transmissions in the same stage 
in D2DMAC. In the next section, we demonstrate the superior 
performance of D2DMAC by extensive simulations under 
various traffic patterns, and comparison with the Optimal 
Solution and other existing protocols. 

Suppose there are M nonadjacent links, (si,ri), (S 2 H 2 ), 
... , ( Sm,tm), where M is less than or equal to \n/ 2J fT8l . 
To enable concurrent transmissions of the M links, the SINR 
of each link should be larger than or equal to the minimum 
SINR to support its transmission rate. For link (si,ri), its 
SINR should meet 


k^Ptlsiri 


M 

WN 0 + P £ fs^koPtls 
m =2 


> MS{c Siri ). (23) 


-7 


It can be converted to 

M 

JSm,r i‘s m n 


E /wi**»rr 7 < ( fc °fc/, iri ] -WN 0 )/{pk 0 P t ), (24) 


m —2 


MS(c siri ) 


which can be regarded as the Spatial Reuse Region of link 
(si,ri) and is denoted as SRR\. If l S2ri , l S3ri , ■■■ , h M n are 
independent and identically distributed random variables with 
probability density function f(x), then the probability of link 
(si, ri) to support transmission rate c Siri with the interference 
from (s 2 ,r 2 ), ... , (.SmHm) considered is 


-Dsiri — 


ISRRi 


/(u)-/(; 


SMri)dls 2 ri ' ' ' dl SM ri- 

(25) 


Similarly, the Spatial Reuse Regions of links (S 2 H 2 ), , 

(smjJ'm) can a ls° be obtained and are denoted as SRR 2 , 
... , SRRm■ The probability of these links to support their 
transmission rates can also be obtained and is denoted as 
B S2r2 , B S3r3 , ... , B SM . rM . Therefore, the probability of 
concurrent transmissions of (si,n), (s 2 ,f 2 ), ••• , (smHm ) 
can be calculated as 


°siri,s 2 r 2 ,-,SMrM — B SlVl B S2T2 ■ • • B SMTM . (26) 

For link (si,rr), if there are F Siri links that have interfer- 

M 

ence to link (si,rr), i.e. fs m ,r i = Fain > then a sufficient 

m =2 

condition for (si,rr) to support transmission rate c Siri can 
be inferred from (El and is illustrated as follows. For each 
link (s m ,r m ) of the F Siri links, if the distance between its 
transmitter s m and node n, l 8m n meets 


l Sm n > (pk 0 PtF Siri ) 1 /y( k ^ tlsiri 7 - WN 0 ) (27) 

the SINR of (si,ri) will be able to support its transmission 
rate, and the right side of (l27l) can be seen as the interference 
radius of link (si,tt). The distances between the transmitters 
of the F Siri links and receiver n should be larger than or 
equal to the interference radius to support the transmission 


rate of (si,ri). Similarly, the interference radiuses of other 
links can also be calculated. 

In Fig. Q] we plot the interference radii of link (V|, r t ) under 
different numbers of interference links. The bandwidth W is 
set to 1760 MHz, and l Siri is set to 2 m . p is set to 1, and 
the transmit power P t is set to 0.1 mW. Other parameters 
are the same as those in Table I of Ref. J30J. In Fig. a ux 
we compare three modulation and coding schemes in IEEE 
802.llad, QPSK with LDPC codes of coding rate 1/2, 3/4, 
and 7/8 respectively. According to their BER performance in 
Ref. ED, their minimum SINRs to support transmission rates 
of 1760 Mbps, 2640 Mbps, and 3080 Mbps are 5 dB, 8 dB, 
and 10 dB respectively. The path loss exponent is set to 2. 
From the results, we can observe that the interference radius 
increases with the number of interference links, F siri . The 
more interference sources, the farther they should be located 
relative to receiver r\ to reduce interference. On the other 
hand, the interference radius also increases with the minimum 
SINR to support c Siri , MS(c Siri ). When c Siri increases, 
higher SINR is required, and the interference sources should 
be located farther relative to receiver r± to avoid severe 
interference. 

In Fig. [T] (b), we compare the interference radii of QPSK 
with LDPC codes of coding rate 1/2 under different path loss 
exponents. As we can see, the interference radius decreases 
with the path loss exponent. With a higher path loss exponent, 
the interference power decreases more rapidly, especially with 
more interference sources, and thus interference sources can 
be located nearer to receiver n. 



Fig. 4. Interference radii under different numbers of interference links. 

In Fig. \5] we compare the interference radii of QPSK with 
LDPC codes of coding rate 7/8 under different link lengths 
and transmit power. From Fig. [5] (a), we can observe the 
interference radius increases with the link length l siri signif¬ 
icantly. The longer the link length, the less the signal power 
received. Thus, the interference sources should be located 
farther relative to receiver r\. Therefore, in the user densely 
distributed scenario, the link length is shorter on average, and 
the interference radius will also be shorter, which indicates the 
concurrent transmission conditions can be satisfied more easily 
in this case. From Fig. [5] (b), we can observe the interference 
radius decreases with the transmit power. With more transmit 
power, more signal power is received, and thus interference 
sources can be located nearer to receiver r 1 . 

VII. Performance Evaluation 

In this section, we give an extensive performance evalu¬ 
ation for our proposed joint scheduling scheme, D2DMAC, 
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(a) Different link lengths (b) Different transmit power 

Fig. 5. Interference radii under different numbers of interference links. 


under various traffic patterns. Specifically, we compare its 
performance with the optimal solution and other protocols, 
and analyze its performance with different path selection 
parameters. 


A. Simulation Setup 


In the simulation, we consider a typical dense deployment 
of small cells in the 60 GHz band 0, where nine APs are 
uniformly distributed in a square area of 50 m x 50 m, and 
the gateway is located at the center point. There are 30 WNs 
uniformly distributed in this area, and each WN is associated 
with the nearest AP. We adopt the simulation parameters in 
Table II of Ref. IfTTl . which are listed in Table Q] We set the 
duration of one time slot to 5 ps and the size of data packets 
to 1000 bytes {H. We set three transmission rates, 2 Gbps, 4 
Gbps, and 6 Gbps according to the distances between devices. 
For links of the backhaul network, we set their transmission 
rates to 6 Gbps due to their better channel conditions. With a 
transmission rate of 2 Gbps, a packet can be transmitted in a 


time slot ED- The AP can access , T _ hp +2-t S ifs+Tack 
nodes in one time slot in lfT8l . With a small cell of it) 
nodes, the AP is able to complete the traffic demand polling 
stage or the schedule pushing in one time slot. Similarly, 
with the optimized backhaul network, the control information 
transmission between the central controller and APs can also 
be completed in a few time slots. Thus, for the simulated 
network, the central controller is able to complete the polling 
of traffic or schedule pushing in a few time slots. Generally, 
it takes only a few time slots for the central controller to 
complete schedule computation. The simulation length is set 
to 0.5 seconds. The delay threshold is set to 10 4 time slots, and 
packets with delay larger than the threshold will be discarded. 
Initially, each flow has a few packets randomly generated 
to transmit. In the simulation, we assume nonadjacent links 
are able to be scheduled for concurrent transmissions. Since 
retransmission is not the focus of this paper, we do not 
consider it in the simulations. 

In the simulation, we set two kinds of traffic modes, the 
Poisson Process and Interrupted Poisson Process (IPP). 

1) Poisson: packets arrive at each flow following the Pois¬ 
son Process with arrival rate A. The traffic load in Poisson 
traffic, denoted by T), is defined as 


Ti 


A x L x N 
R 


(28) 


where L is the size of data packets, N is the number of flows, 
and R is set to 2 Gbps. 


TABLE I 

Simulation Parameters 


Parameter 

Symbol 

Value 

PHY data rate 

R 

2Gbps, 4Gbps, 6 Gbps 

Propagation delay 

Sp 

50ns 

Slot Duration 

Tslot 

5 n s 

PHY overhead 

TpHY 

250ns 

Short MAC frame Tx time 

Tsh.Fr 

Tp h y 14* 8/R+(5p 

Packet transmission time 

Tpacket 

1000*8/R 

SIFS interval 

TsiFS 

100ns 

ACK Tx time 

Tack 

TshFr 


2 ) IPP: packets arrive at each flow following an Interrupted 
Poisson Process (IPP). The parameters are Ai, A2, pi, and p 2 - 
The arrival intervals obey the second-order hyper-exponential 
distribution with a mean of 


E{X) = Y + Y- 

M A2 

The traffic load Tj in this mode is defined as 

Lx N 

rp _ _ 

~ E{X) x R 


(29) 


(30) 


We evaluate the system performance by the following four 
metrics. To explain them clearly, we denote the set of packets 
transmitted in the simulation for each flow i as .S',. For each 
packet e in Si, we denote its delay in units of time slots as 
y e . We also denote the delay threshold as TH. 

1) Average Transmission Delay: The average transmission 
delay of received packets from all flows; we evaluate it in units 
of time slots, which can be expressed as 


N 

E E 2/e 

Average Transmission Delay =———-—. (31) 

E \Si\ 

i=l 

2) Network Throughput: The total number of successful 
transmissions of all flows until the end of simulation. For each 
packet, if its delay is less than or equal to the threshold, it will 
be counted as a successful transmission. With fixed simulation 
length and packet size, it shows throughput performance well 
and can be expressed as 


N 

Network Throughput = ^ |{e|e £ Si, y e < TH}\. (32) 

i=l 

3) Average Flow Delay: The average transmission delay of 
some flows, which may be transmitted though ordinary paths 
or direct paths. In the simulation, we investigate two cases. 
In the first case, we evaluate the average transmission delay 
of flows between WNs. We denote the set of flows between 
WNs as Bw- Then Average Flow Delay in this case can be 
expressed as 

E E 2/e 

Average Flow Dclayi = IgBw eeS ' —. (33) 

.E \Si\ 

i(z Bw 

In the second case, we evaluate the average transmission 
delay of flows from or to the Internet. With the set of flows 
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from or to the Internet denoted as In, Average Flow Delay in 
the second case can be expressed as 

E E Ve 

Average Flow Delay 2 = i£In eGS ' —_ (34) 

E \Si\ 

*€ In 

4) Flow Throughput: The number of successful transmis¬ 
sions achieved by each flow until the end of simulation. In the 
simulation, we also investigate two cases. In the first case, we 
evaluate the average flow throughput of flows between WNs, 
which can be expressed as 

E \{e\e e Si, y e < TH}\ 

Flow Throughput 1 = ^^-—:-. (35) 

pw| 

In the second case, we evaluate the average flow throughput 
of flows from or to the Internet, which can be expressed as 

E |{e|e 6 Si, y e < TH}\ 

Flow Throughput2=—^-r—:-. (36) 

I^-nI 

In the simulation, we compare the performance of 
D2DMAC with the following three benchmark schemes: 

1) ODMAC: In ODMAC, device-to-device transmissions 
are not enabled, and all flows are transmitted through their 
ordinary paths. The scheduling algorithm of ODMAC is 
the same as D2DMAC. This benchmark scheme represents 
the current state-of-the-art work in terms of scheduling the 
access or backhaul without considering the device-to-deivce 
transmissions (T8), [|2Q| . ll2TI . 

2) RPDMAC'. RPDMAC selects the transmission path for 
each flow randomly from its direct path and ordinary path. 
Its scheduling algorithm is the same as D2DMAC. Thus, 
RPDMAC is a good benchmark scheme to show the advan¬ 
tages of the path selection criterion in D2DMAC. 

3) FDMAC-E. FDMAC-E is an extension of FDMAC 
ED, and to the best of our knowledge, FDMAC achieves 
highest efficiency in terms of spatial reuse. In FDMAC-E, the 
transmission path is selected the same as D2DMAC with the 
path selection parameter f} equalling to 2. However, in order to 
show the role of backhaul optimization, the access links and 
backhaul links are separately scheduled in FDMAC-E. The 
access links from WNs to APs are scheduled by the greedy 
coloring (GC) algorithm of FDMAC in fl8l . The backhaul 
links on the transmission path are scheduled by the serial 
TDMA. The access links from APs to WNs are also scheduled 
by the GC algorithm of FDMAC. 

B. Comparison with the Optimal Solution 

We first compare D2DMAC with the optimal solution of the 
MILP problem, where the path selection parameter /3 is set to 
2. Since obtaining the optimal solutions takes long time, we 
simulate a scenario of nine cells with ten users and ten flows. 
The simulation length is set to 0.025 seconds, and the delay 
threshold is set to 50 time slots. The traffic load is defined the 
same as in Section IVII-AI with N equal to 10. 


We plot the delay and throughput comparison of D2DMAC 
and the Optimal Solution under Poisson traffic in Fig. [6] From 
the results, we can observe that the gap between the delay 
of D2DMAC and Optimal Solution is negligible when the 
traffic load does not exceed 1.5625. With the increase of 
traffic load, the gap increases slowly. In terms of network 
throughput, the gap between D2DMAC and the Optimal 
Solution is negligible. When the traffic load is 2.8125, the 
gap is only about 2.9%. Therefore, we have demonstrated that 
D2DMAC achieves near-optimal performance in some cases. 
Besides, with the path selection parameter (3 optimized, the 
gap between D2DMAC and Optimal Solution will decrease 
further. 



(a) Average transmission delay (b) Network throughput 
Fig. 6. Delay and throughput comparison of Optimal Solution and D2DMAC. 


We also plot the average execution time of D2DMAC and 
Optimal Solution under Poisson traffic in Fig. [7] We can 
observe that Optimal Solution takes much longer execution 
time than D2DMAC, and the gap increases with the traffic 
load, which indicates D2DMAC has much lower computa¬ 
tional complexity. 



Fig. 7. Execution time comparison of Optimal Solution and D2DMAC. 

C. Comparison with Other Protocols 

We plot the network throughput of D2DMAC, RPDMAC, 
ODMAC, and FDMAC-E under different traffic loads in Fig. 
[8] where the path selection parameter S is also set to 2. 
As we can observe, under light load from 0.5 to 1.5, the 
throughput of the four protocols is almost the same. However, 
in terms of the throughput, ODMAC starts to drop at the 
traffic load of 1.5, RPDMAC starts to drop at the traffic 
load of 2, and FDMAC-E starts to drop at the traffic load 
of 4.5. Conversely, the throughput of D2DMAC increases 
linearly with the traffic load. Under Poisson traffic, D2DMAC 
outperforms RPDMAC by more than 7x at the traffic load of 5. 
Since direct transmissions between devices are not enabled in 
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ODMAC, the gap between D2DMAC and ODMAC is even 
larger than that between D2DMAC and RPDMAC, which 
indicates device-to-device transmissions can improve network 
throughput significantly. The throughput of D2DMAC and 
FDMAC-E starts to diverge at the traffic load of 2.5, and 
the gap increases with the traffic load. When the traffic load 
is 5, D2DMAC increases the network throughput by about 
55.8% compared with FDMAC-E under Poisson traffic. Since 
the transmission paths of FDMAC-E are the same as those 
of D2DMAC, the gap is caused by the joint scheduling of 
access and backhaul links in D2DMAC. Therefore, it also 
demonstrates the great benefits of the joint scheduling of 
the access and backhaul networks in terms of enhancing the 
system performance. 



(a) Poisson traffic (b) IPP traffic 

Fig. 8. Network throughput of four protocols under Poisson and IPP traffic. 


We further plot the average flow throughput of four pro¬ 
tocols for IPP traffic under different traffic loads in Fig. [9] 
These results are consistent with those in Fig. [8] and indicate 
D2DMAC improves the flow throughput of flows both between 
devices and from or to the Internet significantly compared with 
RPDMAC, ODMAC, and FDMAC-E. 



(a) Flows between WNs (b) Flows from or to the Internet 
Fig. 9. Average flow throughput of four protocols under IPP traffic. 


To analyze the impact of user (WN) distribution density on 
the performance of D2DMAC, we investigate five cases of 
user deployment, i.e., 20, 25, 30, 35, and 40 WNs uniformly 
distributed in a square area of 50 m x 50 m. The traffic load 
is set to 4. In Fig. [TOj we plot the network throughput of four 
protocols with different number of WNs. From the results, 
we can observe that D2DMAC outperforms the other three 
protocols significantly. The network throughput of D2DMAC 
increases with the number of WNs. The reason is that with the 
increase of WN, the WN distribution density increases, and the 
average distance between nodes decreases. In this case, since 
the backhaul network does not change, there are more flows 
transmitted through the direct transmission path. Furthermore, 
the transmission rates of the direct transmission paths also in¬ 
crease due to shorter link length. Thus, the network throughput 


of D2DMAC increases when the number of WNs increases. 
As stated before, the gap between D2DMAC and FDMAC-E 
also indicates the advantages of joint scheduling of access and 
backhaul networks in D2DMAC. 



Fig. 10. Network throughput of four protocols under different number of 
WNs. 


In Fig. |TT] we also plot the average flow throughput of four 
protocols under different number of WNs. From the results, 
we can observe that they are consistent with those in Fig. 
m which demonstrates D2DMAC has better utilization of 
the device-to-device transmissions to improve network perfor¬ 
mance under the user densely distributed scenario. Combining 
the results of Fig. 0(a), we have demonstrated that D2DMAC 
has superior performance in the user densely distributed sce¬ 
nario of the next generation mobile broadband. 



(a) Flows between WNs (b) Flows from or to the Internet 
Fig. II. Average flow throughput of four protocols under IPP traffic. 


D. Performance under Different Path Selection Parameters 

We evaluate the performance of D2DMAC under different 
path selection parameters. We investigate four cases, with /3 
equal to 1, 2, 3, 4, and 5 respectively. For simplicity, we 
denote these cases by D2DMAC-1, D2DMAC-2, D2DMAC-3, 
D2DMAC-4, and D2DMAC-5. 

1) Delay: We then evaluate the average transmission delay 
of D2DMAC with different path selection parameters under 
different traffic loads in Fig. [12] From the results, we can 
observe that with the increase of traffic load, the delay of 
these protocols increases, and /3 has a big impact on the 
performance of D2DMAC. As we can observe, D2DMAC- 
2 achieves the best delay performance. When /3 decreases 
from 5 to 2, the delay becomes better. However, when 3 is 
equal to 1, its delay becomes worse. With /3 becomes smaller, 
the priority of device-to-device transmissions becomes higher. 
However, there is also some cases where transmission through 
the backhaul network outperforms device-to-device transmis¬ 
sion. For example, when the distance between two devices is 
large, device-to-device transmission between them may be not 
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optimal. Besides, transmissions through the backhaul network 
usually have multiple hops, which may increase the probability 
of concurrent transmissions for hops from different flows. 
Therefore, in practice, /3 should be optimized according to 
the actual network states and settings. 



Traffic Load Traffic Load 


(a) Poisson traffic (b) IPP traffic 

Fig. 12. Average transmission delay of D2DMAC with different 0. 

We also plot average flow delay of D2DMAC with different 
path selection parameters under IPP traffic in Fig. [13] From 
the results, we can observe that D2DMAC with /3 equal to 2 
outperforms D2DMAC in other cases for flows both between 
WNs and from or to the Internet. Therefore, the path selection 
parameter should be optimized to achieve low flow delay. 



J 1 l.J Z..J D J.J ‘t H.J J U.J 1 l.J i. i.J J J.J H ‘t.J J 

Traffic Load Traffic Load 

(a) Flows between WNs (b) Flows from or to the Internet 


Fig. 13. Average flow delay of D2DMAC with different 0. 

2) Throughput: We plot the network throughput of 
D2DMAC with different different path selection parameters 
under different traffic loads in Fig. [14] We can observe that the 
throughput is consistent with the delay in Fig. [12] Under light 
traffic load, throughput increases with the traffic load. When 
the traffic load exceeds some critical point, which is different 
for different j3, the throughput starts to decrease. For example, 
the critical point for D2DMAC-3 is 4. This can be explained 
as follows. Delay increases with the traffic load, and when 
traffic load exceeds the critical point, the network becomes 
congested. In this case, the delay of a considerable number 
of packets exceeds the threshold, and thus these packets are 
discarded as failed transmissions, which leads to the abrupt 
drop in the throughput curve. We can also observe that the 
critical point for D2DMAC-2 is larger than 5, which indicates 
its better performance under heavy load. 

We also plot the average flow throughput of D2DMAC with 
different path selection parameters under Poisson traffic in Fig. 
n~5l The results indicate the importance of choosing a suitable 
path selection parameter for flows both between WNs and 
from or to the Internet. 

In conclusion, through extensive simulations, we demon¬ 
strate D2DMAC achieves near-optimal performance in some 
cases in terms of delay and throughput with low complexity. 



(a) Poisson traffic fb) IPP traffic 

Fig. 14. Network throughput of D2DMAC with different 0. 




(a) Flows between WNs (b) Flows from or to the Internet 
Fig. 15. Average flow throughput of D2DMAC with different 0. 

and outperforms other protocols significantly. Furthermore, 
results under different path selection parameters suggest the 
path selection parameter should be optimized in practice to 
maximize performance improvement. 

VIII. Conclusion 

In this paper, we consider the optimal joint scheduling 
problem of access and backhaul networks for small cells in 
the mmWave band. We propose a centralized MAC schedul¬ 
ing scheme, D2DMAC, where direct transmissions between 
devices, i.e., device-to-device transmissions, are enabled for 
performance improvement. In D2DMAC, we propose a path 
selection criterion to decide whether device-to-device trans¬ 
mission should be enabled for each flow, and propose a 
transmission scheduling algorithm to explore the potential of 
concurrent transmissions for maximizing spatial reuse gain. Fi¬ 
nally, through extensive simulations, we evaluate and analyze 
the delay and throughput of D2DMAC with different path se¬ 
lection parameters. Comparing D2DMAC with other protocols 
shows device-to-device transmissions and joint scheduling of 
access and backhaul networks can greatly improve network 
performance. Furthermore, comparison between D2DMAC 
and the optimal solution demonstrates D2DMAC achieves 
near-optimal performance in some cases in terms of delay 
and throughput. In the future work, we will develop the dis¬ 
tributed version of D2DMAC and also consider incorporating 
the NLOS transmission and retransmissions into D2DMAC. 
Besides, we will also consider fading in the model, and 
investigate the impact of fading on the network performance. 
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